On minimality of convolutional ring encoders 
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Abstract — Convolutional codes are considered with code se- 
quences modelled as semi-infinite Laurent series. It is wellknown 
that a convolutional code C over a finite group Q has a minimal 
trellis representation that can be derived from code sequences. 
It is also wellknown that, for the case that 5 is a finite field, any 
polynomial encoder of C can be algebraically manipulated to 
yield a minimal polynomial encoder whose controller canonical 
realization is a minimal trellis. In this paper we seek to extend 
this result to the finite ring case Q — Zpr by introducing a 
socalled "p-encoder". We show how to manipulate a polynomial 
encoding of a noncatastrophic convolutional code over Zpr to 
produce a particular type of p-encoder ("minimal p-encoder") 
whose controller canonical realization is a minimal trellis with 
nonlinear features. The minimum number of trellis states is then 
expressed as p^, where 7 is the sum of the row degrees of the 
minimal p-encoder. In particular, we show that any convolutional 
code over Zpi- admits a delay-free p-encoder which implies the 
novel result that delay-freeness is not a property of the code but of 
the encoder, just as in the field case. We conjecture that a similar 
result holds with respect to catastrophicity, i.e., any catastrophic 
convolutional code over Zpr admits a noncatastrophic p-encoder. 
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I. Introduction 

There exists a considerable body of literature on convolu- 
tional codes over finite groups. In this paper we are interested 
, in trellis representations that use a minimum number of states. 
' Since decoders, such as the Viterbi decoder, are based on trellis 
representations, minimality is a desirable property that leads 
to low complexity decoding. In Sect. VI-D] a minimal 
encoder construction is presented in terms of code sequences 
' of the code, involving socalled "granule representatives", see 
.also lfT6i . This is a powerful method that applies to con- 
'volutional codes over any finite group Q. It is wellknown 
that, for the case that ^ is a field, any polynomial encoder 
of a convolutional code can be algebraically manipulated 
to yield a so-called "canonical polynomial encoder" (left 
prime and row reduced) whose controller canonical realization 
■ yields a minimal trellis representation of the code. This is a 
'fundamental result that is useful in practice because codes 
are usually specified in terms of encoders rather than code 
sequences. In this paper we seek to extend this result to the 
finite ring case Q = Zpr, where r is a positive integer and 
p is a prime integer The open problem that we solve is also 
mentioned in the 2007 paper |23|. We first tailor the concept of 
encoder to the Zpr case, making use of the specific algebraic 
finite chain structure of Zpr. This leads to concepts of "p- 
encoder" and "minimal p-encoder". We then show how to 
construct a minimal p-encoder from a polynomial encoding 
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of the code. The minimal p-encoder translates immediately 
into a minimal trellis realization. Thus our results allow for 
easy construction of a minimal trellis representation from a 
polynomial encoding and parallel the field case. 

Convolutional codes over rings were introduced in flT\, lITSi 
where they are motivated for use with phase modulation. In 
particular, convolutional codes over the ring Zj\/ are useful for 
Af-ary phase modulation (with M a positive integer). By the 
Chinese Remainder Theorem, results on codes over Zpr can 
be extended to codes over Z^f, see also |fT9l . |[T], ||2l, ||9l- 

Most of the literature on convolutional codes over rings adopts 
an approach in which code sequences are semi-infinite Laurent 
series [6|, LIJJ, [15], L16J, [9J, H, US, ||26l. In order to make 
a connection with this literature, we adopt this approach in our 
definition of a convolutional code: a linear convolutional code 
C of length n over Zpr is defined as a subset of (Z"r)^ for 



which there exists a polynomial matrix G{z) G 
that 



such 



C = {ce {Z'^rf I 3 M e (Z^.) : c = uG{z) and 

supp u C [N, 00) for some integer N}. (1) 

Here supp u denotes the support of u, i.e., the set of time- 
instants t e Z for which u{t) is nonzero. Further, z denotes 
the right shift operator zu{t) = u{t — 1). Clearly, ([TJ implies 
that C is linear and shift-invariant with respect to both z and 
z^^. If the matrix G{z) has full row rank then G{z) is called 
an encoder of C. 

For the field case any linear convolutional code admits a 
left prime polynomial encoder, i.e., an encoder that has a 
polynomial right inverse. Such an encoder G{z) gives rise to 
the following two properties: 

1) delay -free property: for any N E Z 



supp c C [N, 00) = 
2) noncatastrophic property: 
supp c is finite = 



stipp u C [N, 00) 



supp u is finite, 



where c = uG{z). Clearly, in the field case, "delay-free"- 
ness and "catastrophicity" are encoder properties, not code 
properties. For the ring case, however, there are codes that 
do not admit a noncatastrophic encoder. For example (see ||6l, 
[21J. [4J) the convolutional code over Z4 with encoder G{z) = 
[1 -I- z 1 + 3z] does not admit a noncatastrophic encoder. 
Similarly, the rotationally invariant convolutional code over Z4 
with encoder G{z) — [3 -|- 3z -I- 3z^ 3 + z + z'^] does not 
admit a noncatastrophic encoder. The reader is referred to ifTSi 
for motivation and characterization of rotationally invariant 
codes over rings. Further, there are codes that do not admit 
a delay-free encoder. For example (see ifTSl . lfT6l . H) the 



convolutional code over Z4 with encoder G{z) = [2 2 + z] 
does not admit a delay-free encoder Note that some codes 
over Zpr do not even admit an encoder, for example over Z4 
the code given by ([T]i with 



The literature (see e.g. [4, subsect. V-C]) has declared the 
properties of "delay-free" and "catastrophic" to be properties 
of the code rather than the encoding procedure. By resorting to 
a particular type of polynomial encoder, named "p-encoder", 
we show in section |III] that delay-freeness is not a property 
of the code but of the encoding procedure, just as in the field 
case, see also {\T\. We conjecture that the same is true for 
catastrophicity. To support this argument, in section |IV] we 
examine specific catastrophic convolutional codes over Zpr 
and show that a noncatastrophic p-encoder exists for these 
examples. 

A more recent approach ||22l (see also Q, ll23l ) to convo- 
lutional codes focuses on so-called "finite support convolu- 
tional codes" in which the input sequence u corresponds to 
a polynomial. Thus the natural time axis is Z+ and both 
input sequences and code sequences have finite support. Finite 
support convolutional codes are, by definition, noncatastrophic 
(Property 2 above) and can be interpreted as sub modules of 
Zpr[z]. For n = 1 connections can be made with polynomial 
block codes. For more details the reader is referred to our 
paper ifTTl . 

II. Preliminaries 

A set that plays a fundamental role throughout the paper is 
the set of "digits", denoted by = {0, 1, . . . ,p — 1} C Zpr. 
Recall that any element a G Zpr can be written uniquely 
as a — 9o + Oip + • • • + 9r-ip^^^, where 9^ £ Ap 
for £ = 0, . . . , r — 1 (p-adic expansion). This fundamental 
property of the ring Zpr essentially expresses a type of linear 
independence among the elements 1, p, p^, p''"^. It leads to 
specific notions of "p-linear independence" and "p-generator 
sequence" for modules in Zpr, as developed in the 1996 
paper [,24J . For example, for the simplest case n = 1, the 
elements 1, p, p^, p^^^ are called "p-linearly independent" 
in II24I and the module Zpr = span {1} is written as 
'^P^ = p— span {l,p,p^, . . . jP"^^^}. The module Zpr is said 
to have "p-dimension" r. 

In this section we recall the main concepts from ifTSl on mod- 
ules in Zpr [2;], that are needed in the sequel. We present the 
notions of p-basis and p-dimension of a submodule of Zpr[z], 
which are extensions from |24|'s notions for submodules of 
Zpr . From (13] we also recall the concept of a reduced p-basis 
in Zpr [z] that plays a crucial role in the next section. 

Definition II.l. [13| Let {vi{z), . . . ,v„^{z)} c Z^r[z]. 
A p-linear combination of vi{z), . . . ,Vm{z) is a vector 

m 

'^^aj{z)vj{z), where aj{z) G Zpr[z] is a polynomial with 

coefficients in Ap for j — l,...,m. Furthermore, the set 
of all p-linear combinations of vi{z), . . . ,Vjn{z) is denoted 



by p-span{vi{z), . . . ,Vm{z)), whereas the set of all linear 
combinations of vi{z), . . . ,w,„(z) with coefficients in Zpr[z] 
is denoted by span (ui(z), . . . ,Vm{z)). 

Definition II.2. |13| A sequence {vi{z), . . . ,Vmiz)) of 
vectors in Z^V [z] is said to be a p-generator sequence 

if pv,n{z) ~ and pvi{z) is a p-linear combination of 

Vi+i{z), . . .,V.miz) for i = 1, . . . ,TO - 1. 

The next lemma is a straightforward result that is used in 
section Hill 

Lemma IL3. Let {vi{z), . . . ,Vm{z)) be a p-generator se- 
quence in Zpr[z]. Then (ui(0), . . . , ^^(O)) is a p-generator 
sequence in Zpr. 

Theorem IL4. EHS Let vi{z), . . . ,Vm{z) G Z;jr[z]. // 
(wi(z), . . . , Vm{z)) is a p-generator sequence then 

p-span (ui(z), . . . ,Vra{z)) = span (wi(z), . . .,Vra{z)). 

In particular, p— span . . . is a submodule of 

Z«r[z]. 

Definition II.5. 1131 The vectors vi{z),..., v„-,{z) G Z^V [z] 
are said to be p-linearly independent if the only p-linear 
combination of ui(z), . . . , v„i{z) that equals zero is the trivial 
one. 

Definition II.6. Let M be a submodule of Zpr[z], written as 
a p-span of a p-generator sequence (wi(z), W2(z), • ' ' ^ Vm{z)). 
Then (wi(z), W2(z), • • • ,Wm(z)) is called a p-basis for M 
if the vectors vi{z), . . . , Vm{z) are p-linearly independent in 

Z^r[z]. 

Lemma II.7. [13] Let M be a submodule ofZpr-[z] and let 

{vi{z), V2{z), • • • , Vm{z)) be a p-basis for M. Then each vec- 
tor of M is written in a unique way as a p-linear combination 

0fVi{z), . . . ,Vm{z). 

All submodules of ZpV [z] can be written as the p-span of a p- 
generator sequence. In fact, if M — span (51(2:), . . . ,gk{z)) 
then M is the p-span of the p-generator sequence 

{9i{z),P9i{z), ■ ■ ■ ,p''~^9i{z), ■ ■ ■,gk{z), ■ ■ ■ .p^'^gkiz)). 

Next, we recall a particular p-basis for a submodule of 
Zpr[2;], called "reduced p-basis". We first recall the concept 
of "degree" of a vector in Zprlz], which is the same as in the 
field case. 

Definition II.8. Let v{z) be a nonzero vector in Z^r [z], written 
as v{z) = Vf) + viz + • • • + Vdz"^, with Vi G Zpr, i = 0, . . . , d, 
and Vd ^ 0. Then v{z) is said to have degree d, denoted 
by dcg v{z) = d. Furthermore, Vd is called the leading 
coefficient vector of v{z), denoted by w'^. 

In the sequel, we denote the leading row coefficient matrix of 
a polynomial matrix V{z) by t/'""^. A matrix V{z) is called 
row-reduced if V^^"^ has full row rank. 



Lemma II.9. 051/ Let M be a submodule ofZpr[z], written 
as a p-span of a p-generator sequence (wi (z), . . . , 
with t)^^,...,!!^ p-linearly independent in Z^^. Then 
(di(z), . . . , Vm{z)) is a p-basis for M. 



Definition 11.10. flSl Let M be a submodule of Z^. [z], writ- 
ten as a p-span of a p-generator sequence (fi(z), . . . , 
Then (i'i(z), . . . ,Vm{z)) is called a reduced p-basis for M 



if the vectors v-^ 



are p-linearly independent in Z" 



A reduced p-basis in Zpr[z] generalizes the concept of row 
reduced basis from the field case. Moreover, it also leads 
to the predictable degree property and gives rise to several 
invariants of M, see ifTSl . In particular, the number of vectors 
in a reduced p-basis as well as the degrees of these vectors 
(called p-degrees), are invariants of M. Consequently, their 
sum is also an invariant of M. 

Every submodule M of Zpr[2:] has a reduced p-basis. A 
constructive proof is given by Algorithm 3.11 in |13| that 
takes as its input a set of spanning vectors and produces a 
reduced p-basis of M. It is easy to see that if the input is 
already a p-basis of m vectors, then the algorithm produces a 
reduced p-basis of again m vectors. Since m is an invariant 
of the module, it follows that all p-bases of M have the same 
number of elements. As a result, the next definition is well- 
defined and not in conflict with the shghtly different definition 

of El. 



Definition 11.11. The number of elements of a p-basis of a 
submodule M of '^pr[z] is called the p-dimension of M, 
denoted as p— dim (Af). 



In recent work 1 14) it is shown that computational packages 
for computing minimal Grobner bases can be used to construct 
a minimal p-encoder 

III. Minimal trellis construction from a 

p-ENCODER 

Formally, we define a trellis section as a three-tuple X = 
{lipr, S, K), where S is the trellis state set and K is the set 
of branches which is a subset of 5' x Z^,- x S, see also [S], 
llT6l . A trellis is a sequence X — {Xt}tei, of trellis sections 
Xt = {Zpr , S, Kt). A path through the trellis is a sequence 
(••• ,bt-i,bt,bt+i,- ■ ■ ) of branches bt = (sf,ct,st+i) e Kt 
such that bt+i starts in the trellis state where bt ends for 
t G Z. The set of all trellis paths that start at the zero state 
is denoted by tt{X). The mapping A : Tr{X) ^-+ C^pr)^ 
assigns to every path (• • • ,bt-i,bt,bt+i, ■ ■ ■) its label se- 
quence (• • • , Ct-i, Ct, Ct+i, • • • ). A trellis X is called a trellis 
representation for a convolutional code C if C = X{tt{X)). 

A trellis representation X for a convolutional code C is called 
minimal if the size of its trellis state set S is minimal among all 
trellis representations of C. It is wellknown how to construct a 
minimal trellis representation in terms of the code sequences 
of C. In fact, the theory of canonical trellis representations 
from the field case carries through to the ring case, see |25J, 
Il6l, lfT6l . Since it plays a crucial role in the proof of our main 



result, we recall the definition of canonical trellis in Appendix 
A. 

Let us recall the wellknown controller canonical form. Let TZ 
be a ring. A matrix E{z) G 7^'*^"[z] is realized in controller 
canonical form ifTOl (see also |I5] Sect. 5]) as 

-1 , 



E{z) Biz-'I ~ A) C + D, 



(2) 



as follows. Denoting the i'th row of E{z) by ei{z) = 
J2'e=o ^i-t'^^' where Ci^e G TZ^^" and Ci^Si 0, the matrices 
A, B, C and D in ^ are given by 



A 



Ai 



A. 



B 



Bi 



B, 







ei,o 















c = 



where Ai is a (5^ x bi matrix, is a 1 x 8i matrix and Ci is 
a (5i X 1 matrix, given by 



A, 



1 



1 

_ 

ei,i 



B, 



[1 



for i — 



(3) 



Whenever Si — 0, the ith block in A as well as C is absent and 
a zero row occurs in B. Denoting the sum of the (5i's by 5, it 
is clear that A is a (5 x (5 nilpotent matrix. The above controller 
canonical realization can be visualized as a feedforward shift- 
register with (5 registers. 

In the case that 7^ is a field with q elements it is wellknown fSl, 
ll76l how to obtain a minimal trellis representation for C from 
a polynomial encoder. For this, the rows of the polynomial 
encoder should first be algebraically manipulated (using Smith 
form and row reduction operations) to yield a left prime and 
row reduced encoder G{z). Then G{z) is called canonical 
in the literature, see |fT6l App. II]. A minimal trellis repre- 
sentation of C is then provided b^ the controller canonical 
realization G{z) = B{z^^I - Ay C + D as in Q. Although 
this result is known, in Appendix B we give a proof by 
showing that there exists an isomorphism between the trellis 
state set of the controller canonical realization and the trellis 
state set of the canonical trellis (as defined in Appendix A) of 
C. The set is thus minimal and has q"^ elements, where q is the 
number of elements of the field and v is the sum of the row 
degrees of G{z). The invariant i' is commonly referred to as 
the "degree" of the code C (but called the "overall constraint 
length" in the early literature). The row degrees are called the 
"Forney indices" of the code 120|. 

Below we consider convolutional codes over Zpi- that admit 
a noncatastrophic encoder, for simplicity, we call such codes 



noncatastrophic. We show that such codes admit a particular 
type of polynomial encoder (later called "minimal p-encoder"), 
whose controller canonical realization provides a minimal 
trellis representation, just as in the field case. We are then 
also able to express the minimal number of trellis states in 
terms of the sum of the row degrees of a minimal p-encoder. 

Let us now first introduce the notion of "p-encoder". Recall 
that Ap = {0,l,...,p-1} c Zpr. 

Definition III.l. Let C be a convolutional code of length n 
over Zpr. Let E{z) G Zp^^[z] be a polynomial matrix whose 
rows are a p-linearly independent p-generator sequence. Then 
E{z) is said to be a p-encoder for C if 

C = {ce \ 3ue {A;f : c = uE{z) and 

supp u C [N, oo) for some integer N}. 

The integer k is called the p-dimension of C. Furthermore, 
E{z) is said to be a delay-free p-encoder if for any G Z 
and any c G C, written as c = uE{z) with u G (.4^)^ we 
have 

supp c C [iV, oo) supp u C [iV, oo). 

Also, E{z) is said to be a noncatastroptiic p-encoder if for 
any c £ C, written as c = uE{z) with u G (Ap)^ we have 

supp c is finite supp u is finite. 

Finally, a convolutional code C that admits a noncatastrophic 
p-encoder is called noncatastrophic. 

Thus a difference between a p-encoder E{z) and the encoding 
matrix G{z) of ([T]), is that the inputs of E{z) take their values 
in Ap rather than in Z^r . Note that the idea of using a p-adic 
expansion for the input sequence is already present in the 1993 
paper Q. It was not until 1996 that the crucial notion of p- 
generator sequence appeared in fl^, but only for constant 
vectors — it was extended to polynomial vectors in 113|. In 
our definition the rows of a p-encoder are required to be a 
p-generator sequence consisting of polynomial vectors. 

Recall that a convolutional code over Zp.- is given by ([T]): 

C = {c G {T^rf I 3 u G {"L^. f : c = uG{z) and 

supp u C [N, oo) for some integer N}. 

Also recall that there exist convolutional codes over Zpr that 
do not admit a G{z) of full row rank, i.e. an encoder An 
important observation is that any convolutional code over Zpr 
admits a p-encoder, even a p-encoder E{z), such that the rows 
of E'-^'^ are p-linearly independent in ZpV . Indeed, any reduced 
p-basis of the polynomial module spanned by the rows of 
G{z), produces the rows of such a p-encoder E{z). This shows 
that the concept of p-encoder is more natural than the concept 
of encoder as it is tailored to the algebraic structure of Zpr. 
The next lemma is straightforward. 

Lemma III.2. Let E{z) G Zp^^"[2] be a p-encoder for a 
convolutional code C of length n. Then E(z) is delay-free 



property ( Definition UILli if and only if the rows of E(0) are 
p-linearly independent in Z^r. 

Theorem III.3. Let C be a convolutional code of length n over 
Zpr. Then C admits a delay-free p-encoder E{z) G Zpr"'[z] 
for some integer k, such that the rows of E^"^^ are p-linearly 
independent in Zpr. 

Proof: As noted above, C admits a p-encoder E{z), such 
that the rows of i?'''^ are p-linearly independent in Z^V, i.e., 
they constitute a reduced p-basis. Without loss of generality we 
may assume that the row degrees of E{z) are nonincreasing. 
Let L be the smallest nonnegative integer such that the last 
K — L rows of E{z) are a delay-free p-encoder. 

Now assume that L > (otherwise we are done). If L ^ k it 
means that the last row e„(z) of E{z) can be written as 

e^iz) = z^e^{z), 

where £ > and e„(2) G Z^r[z] with e«(0) 7^ 0. Note that 
degeK(z) < degeK(z). Clearly, (ei(z), . . . , e«_i(z), e«,(z)) 
is a p-encoder of C, whose rows are still a reduced p-basis. 

If L < K, then, by construction, there exist cnj G Ap for 
j = L + 1 , . . . , K, such that 

ei(0) + ^a,e,(0) = 

(use the fact that (ei(0), . . . , 6^(0)) is a p-generator 
sequence by Lemma III. 31 ). Replacing by 
eL{z) := eL(z) + J^jyL^J^ii^) obviously gives a 
p-basis (61(2:), . . . , ei_i(z), eL{z), eL+i(z), . . . , e^iz)) 
of the module spanned by ei(z), 61,(2), ... , 6^(2) 
and, consequently, a p-encoder of C. Moreover, 
by the p-predictable degree property (Theorem 3.8 
of IT3I ). degeL(z) — degeL(z), which means that 
(ei(z), 61,(2:), ... , eK{z)) is still a reduced p-basis. 
Since eL(0) — 0, we can write eL{z) = z^eL{z), 
with 61,(0) ^ and ^ > 0. Note that peL{z) is a 
p-linear combination peL{z) = ^j^i^ l3j{z)ej{z) with 
I3j{z) G .4p[2]. Because of the p-linear independence of 
6l+i(0), . . . , 6^(0), we must have that the coefficients 
(3j{z) are of the form /3j(z) = z^'j3j{z) with Ij > I 
for L + 1 < j < K. Consequently, the sequence 
(6i(z), . . . ,6L_i(z),eL(z),eL+i(z), . . . ,e«(z)) is a p- 
encoderof C, which is still a reduced p-basis withdegei(z) < 
deg6L(z). If (6i(z), . . . ,eL(z),eL+i(z), . . . ,e„(z)) is not a 
delay-free p-encoder, then re-order the vectors so that their 
degrees are nonincreasing and repeat this procedure until a 
delay-free p-encoder for C is obtained. Since the sum of the 
row degrees of p-bases obtained at each step of the procedure 
is lower than in the previous step, a delay-free p-encoder is 
obtained after finitely many iterations. ■ 
The next example is a simple example that illustrates the 
above theorem. 

Example III.4. Over Z4: consider the (2,1) convolutional 
code C of [16 p. 1668] given by the polynomial encoder 

G{z) = [2 2 + z]. 



A delay-free p-encoder for C is given by 

E{z) = 



2 2 + z 
2 



Theorem III.5. Let C be a noncatastrophic convolutional 
code of length n over Zj,r. Then C admits a delay-free 
noncatastrophic p-encoder E{z) £ Zpr^"[z] for some integer 
K, such that the rows of E^'^'^ are p-linearly independent in 

Proof: By definition there exists a noncatastrophic p- 
encoder Ei (z) for C. Apply Algorithm 3.11 of ifTSll to the rows 
of Ei{z). This gives us a reduced p-basis 61(2;), . . . , e^{z) for 
the module spanned by the rows of Ei{z). Define E2(z) as 
the K X n polynomial matrix with ei(z), . . . , e„(2) as rows. 
By construction the rows of Elf^ are p-linearly independent 
in ZpV. It is easy to see that E2{z) is still noncatastrophic. If 
E2{z) is not delay-free apply the procedure of the proof of 
Theorem lIII.3l to E2{z) to obtain a delay-free p-encoder E{z), 
such that the rows of E''^'^ are p-linearly independent in Z^r- . 
It is easy to see that E{z) is still noncatastrophic. ■ 

Definition III.6. Let C be a noncatastrophic convolutional 
code of length n over Zp,-. Let E{z) e Zpr^"[z] be a delay- 
free noncatastrophic p-encoder for C, such that the rows of 
j^irc ^j-g p.jijiearly independent in Z^^. Then E{z) is called 
a minimal p-encoder of C. Furthermore, the p-indices of C 
are defined as the row degrees of E{z) and the p-degree of 
C is defined as the sum of the p-indices of C. 

Thus, in the terminology of section |lll the rows of a minimal 
p-encoder are a reduced p-basis. If the code C has a canonical 
encoder G{z), then both G'*"^ mod p and G(0) mod p have 
full row rank in Z^^*", so that a minimal p-encoder is trivially 
constructed as 

G{z) 
pG{z) 



E{z) = 



p'^-^Giz) 



(4) 



An important observation is that all noncatastrophic codes 
admit a minimal p-encoder E{z) but not all such codes admit 
an encoder G{z) that is row reduced and/or delay-free. 

Definition III.7. Let C be a convolutional code of length n 
with p-encoder E(z) G Zp^^[z]. Denote the sum of the row 
degrees of E{z) by 7 and let 

be a controller canonical realization of E{z). Then the con- 
troller canonical trellis corresponding to E(z) is defined as 

X = {Xt}tez, where Xt = {Z;.,A]„Kt) with 

Kt = {{s{t),s{t)C + u{t)D, s{t)A + u{t)B such that 

s{t)eA;, u{t)eA;}. 

Note that the states take their values in the nonlinear set 
A^, which is not closed with respect to addition or scalar 
multiplication. Similarly, the inputs take their values in the 
nonlinear set Ap. The next theorem presents our main result. 



Theorem III.8. Let C be a noncatastrophic convolutional code 
of length n with minimal p-encoder E{z) G Zpr"[z]. Denote 
the p-degree of C by 7. Then the controller canonical trellis 
corresponding to E{z) is a minimal trellis representation for 
C. In particular, the minimum number of trellis states equals 
p\ 

Proof: see Appendix B. ■ 

In the field case r ~ \ the above theorem coincides with 
the classical result, i.e., the minimum number of trellis states 
equals p'* , where 7 is the degree of the code. 

For convolutional codes that admit a canonical encoder, we 
have the following corollary, which follows immediately from 
applying Theorem llll.Sl to the minimal p-encoder given by (|4|. 
Note that the result coincides with results in |26 Sect. 7.4], 
where a canonical encoder is called "minimal-basic". 

Corollary III.9. Let C be a (n, k) convolutional code that has 
a canonical encoder G{z) g Zp^"[z]. Then the rk p-indices 
of C are the k row degrees of G{z), each occurring r times. 
The minimum number of trellis states equals q^, where v is 
the sum of the row degrees of G{z) and where q — p"". 

The next example illustrates our theory for the more inter- 
esting case where the code does not admit a canonical encoder. 

Example III.IO. Over Z4: consider the (3, 2) convolutional 
code C given by the polynomial encoder 



G(z) 



9i{z) 
52 (z) 



where 



gi(z)=[z2 + l 1 ] and ,g2(z) = [ 2z 2 l]. 

Clearly, G(z) is a left prime encoder whose controller canon- 
ical trellis has 4^ = 64 trellis states. Note that G''''^ does not 
have full row rank and therefore G(z) is not canonical. Denote 
by im G(z) the polynomial module spanned by the rows of 
G(z). A p-basis for the module im G(z) is provided by the 
rows of the matrix 



9i{z) 




" z^ + 1 10' 


251 (^) 




2z2 + 2 2 


52 (z) 




2z 2 1 


. 2.92 (z) _ 




2 



which has leading row coefficient matrix 

1 

2 
2 
2 

The row reduction algorithm of fTT, Algorithm 3.11] is 
particularly simple in this case: by adding z times the third 
row to the second row, we obtain the matrix E{z), given by 



Eiz) 



z^ + 1 



1 

2 2z + 2 z 

2z 2 1 

2 



whose rows are a reduced p-basis for the module im G(z). 
Indeed, the rows of its leading row coefficient matrix, given 
by 

"10 
2 1 
2 
2 

are p-linearly independent. As a result, the p-indices of C are 2, 
1, 1, and the p-degree of C equals 4. The controller canonical 
trellis corresponding to E{z) is given by 



A 



C 





1 
2 1 

2 



B = 



D 



1 1 

2 2 
2 1 
2 



This trellis is minimal with 2 = 16 trellis states. 

Example III.ll. Over Z4: consider the (2, 1) convolutional 
code C of Example IIII.4I given by the polynomial encoder 
G(z) = [2 2 + z] (note that G{z) is not delay-free). The 
delay-free p-encoder 



E{z) 



2 2 + z 
2 



of Example IIII.4I is clearly minimal, so that its corresponding 
trellis is minimal with 2 states which concurs with |6|. 

IV. Conclusions 

An important class of polynomial encoders for convo- 
lutional codes over a field are the canonical ones. Their 
feedforward shift register implementations are minimal trellis 
representations of the code. The trellis state space is linear 
However, for convolutional codes over the finite ring Zpr, 
the literature has generalized this result only for restricted 
cases. In this paper we introduce the concept of p-encoder 
and define minimal p-encoder for the class of noncatastrophic 
convolutional codes. We show how to obtain a minimal p- 
encoder from a polynomial encoding of the code. We show 
that the feedforward shift register implementation of such a 
minimal p-encoder is a minimal trellis representation of the 
code. Its trellis state space is nonlinear We also express the 
minimal number of states in terms of the row degrees of the 
minimal p-encoder. In our view a minimal p-encoder is the 
ring analogon of the "canonical polynomial encoder" from 
the field case. We also present the novel concepts of p-indices 
and p-degree of a code as analogons of the field notions of 
"Forney indices" and "degree", respectively. 

Our approach allows us to view "delay-freeness" as a property 
of the p-encoder Thus we arrive at the novel result that delay- 
freeness is a property of the encoding (just as in the field 
case) rather than a property of the code, as in the literature 
so far (see e.g. ||4i subsect. V-C]). We conjecture that a 
similar phenomenon occurs with respect to catastrophicity, i.e.. 



"noncatastrophic" is a property of the p-encoder, not the code. 
This would imply that minimal p-encoders can be obtained for 
all convolutional codes over Zpr, including the catastrophic 
codes. This is of particular importance for rotationally invari- 
ant catastrophic codes, see e.g. flSl . It is a topic of future 
research to investigate this conjecture which is likely to involve 
a generalization of a type of "normal form" for polynomial 
matrices over Zpr. To support our conjecture, let us examine 
the rotationally invariant catastrophic code Ci over Z4 given 
by the encoder Gi{z) = [3 + 3z + 3 + z + z^] . A 

noncatastrophic minimal p-encoder for Ci is given by 

3 + 3z + 3z2 3 + z + z2 
2 2 



E^{z) 



yielding a minimal trellis representation of Ci with 4 states. 
Similarly the catastrophic code C2 over Z4 with encoder 
6*2(2) = [1 + z 1 + 3z] has a noncatastrophic minimal p- 
encoder 

, -s I + Z 1 + 3Z 

E2{z) - ^ 2 2 
yielding a minimal trellis representation of C2 with 2 states. 
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Appendix A 

In this appendix we recall the construction of a minimal 
trellis for a convolutional code C as a so-called two-sided 
realization of C, see US], ID, ||2T], ([Bl, HH, lUg). Consider 
two code sequences c G C and c G C. Conform ll25l . the 
concatenation at time i G Z of c and c, denoted by c f\t c, is 
defined as 

for t' < t 
for t' > t ■ 



The code sequences c and c are called equivalent, denoted by 
c ~ c, if 

c Ao c G C. 

Definition A.l. Let C be a linear convolutional code of length 
n over a finite ring TZ. The canonical trellis of C is defined as 
X = {Xjtez, where Xt = (7^", Kt) with S := C mod ~ 
and 

Kt {(s(t), c{t),s{t + 1)) I s{t) = z"*c mod ~ and 

s{t + 1) = z"*"^c mod ~}. 

It has been shown in |25| that the above trellis is minimal. 
Intuitively this is explained from the fact that, by construction, 
states cannot be merged. 



Appendix B 

In this appendix we prove Theorem IIII.8I via a bijective 
mapping from the controller canonical trellis state set to the 
trellis state set of the canonical trellis that is defined in 
Appendix A. We first provide the proof for the field case. 
In our proof of Theorem IIII.8I which is the ring case, we are 
then able to highlight the parts that are different from the proof 
for the field case. 

Theorem B.l. Let C be a {n, k) convolutional code of degree 
V over a finite field TZ with canonical encoder G{z) G 
^fexnj-^j fi^gfj fi2g controller canonical trellis corresponding 
to G{z) is a minimal trellis representation for C. In particular, 
the minimum number of trellis states equals q'^ , where q is the 
size of the field TZ. 

Proof: Denote the memory of C by v^,, i.e., v^, is the 
maximal Forney index of C. Consider the mapping 6 : TZ'^ ^ 
C mod ~, given by 

0(s) [cU 

where c S C passes through state s at time 0. The mapping 
Q is well-defined since for any s there exists such a code 
sequence and any two code sequences that pass through state 
s at time are obviously equivalent. 

Since the trellis state set C mod ~ of the canonical trellis 
of Appendix A is minimal, it suffices to prove that <d is 
an isomorphism, as follows. Surjectivity follows immediately 
from the fact that all code sequences pass through some 
state at time 0. Furthermore, the mapping Q is linear since 
0(si + S2) = [ci +C2]^. It remains to prove that 9 is 
injective. 

For this, let s e TZ" be such that e(s) = 0. Define 
m(— 2), u(— 1) as elements of TZ'^ for which 



u{-2) u{-l) 



BA 
B 



Define «:=(■•• , 0, 0, • • • , u(-2), 0, 0, • • • ) 

and let c := G{z)u be the corresponding code sequence. Then 
clearly c passes through s. From Q{s) — Q it now follows that 
the sequence cAqO is a code sequence. Denote its state at time 
by s' and its input sequence by u' . Then clearly 



/(-2) u'(-l) 



BA 
B 



We now prove that s = s', as follows. Firstly, it is clear that 

[ c(-z.,) ••■ c(-2) c(-l) ] = 



Furthermore, from the fact that the encoder is delay-free 
(Property 1 in section IJi it follows that D — G(0) has full 
row rank and that u'{£) — for £ < —v^,. As a result. 



[ ••• u(-2) u{-l) 



D BC BAG ■■ 
D BG ■■ 
D ■■ 



(5)- 



c(-2) c(-l) ] 



u'{~2) ] 



D BC BAG 
Q D BC 
D 



(6) 

Since D has full row rank, the matrix in the above equation 
also has full row rank. Since the right-hand sides of equa- 
tions (|5]l and (|6]l are equal, it then follows that u{t} ~ u'{£) 
for —I/, < £ < —1. As a result s = s'. 

We now prove that s = 0. By the above, c Ao is a code 
sequence that passes through s at time 0. Its input sequence 
u' is of the form 

(••• ,0,0,u'(-z.,),--- ,u'(A/),0,0,---), 

where M > 0. Here we used the fact that the encoder is 
noncatastrophic (Property 2 in section By construction the 
state of c Ao at time AI + i^* + 1 then equals zero. We now 
use the row reducedness of G{z) to conclude that s = 0, as 
follows. Denote the state at time M + by s. Now recall the 
formula (|3]l for the controller canonical form. Since sA ~ 0, 
the nonzero components of s must be last components in a 
(1 X i^i)-block in s. Also, c(Af + v^) = 0, so that sG = 0. 
By construction, the last rows of the (yi x n) -blocks of G are 
rows from G''"^ and are therefore linearly independent. As a 
result, s = 0. Repeating this argument again and again, we 
conclude that u'(0) = ... ~ u'{M) ~ and all states for time 
> are zero, so that, in particular s ~ 0, which proves the 
theorem. Obviously, the size of the trelhs state set S equals 
(j". ■ 
We now turn to the ring case to prove the analogon of 
the above theorem. As compared to the field case, the proof 
requires some care because the trellis state set is not linear 

Proof of Theorem lIlLSl 

Define i/^, as the maximal p-index of C. Consider the mapping 
given by 



e:A],^G mod 



e(s) := [cl 



where c G C passes through state s at time 0. Then 8 can 
be shown to be well-defined and surjective, as in the proof of 
Theorem lB.il Note that Q is not necessarily a linear mapping. 
As a result, injectivity can no longer be proven by showing 
that 0(s) = only for s = 0, as in the proof of Theorem lB.il 
Thus, to show that 8 is injective, let s and s e be such that 
8(s) = 8(s). Let c be the code sequence that passes through 
s at time 0, as defined in the proof of Theorem IB. II Let c be 
the analogous code sequence that passes through s at time 0. 
Note that both c and c have finite support. From 8(s) = 8(s) 
it now follows that the sequence c Ao c is a code sequence. 
Denote its state at time by s' and its input sequence by 
u' G {Ap)^. Since E{z) is a delay-free p-encoder, the rows 
of -^(O) are a p-basis (use also Lemma III. 3b . By Lemma 2.8 
of lfT3l (see also ll24l "). it now follows from the fact that inputs 



only take values in Ap that s — s'. The reasoning is as in the 
proof of Theorem IB. II 

We now prove that s = s. By the above, c Aq c is a code 
sequence that passes through s at time 0. As in the proof 
of Theorem IB. II it follows that its state equals zero at time 
M + ly^, + 1 for some M > 0. Since E{z) is a minimal p- 
encoder, the rows of E^'^'^ are p-linearly independent. It now 
follows from the fact that states only take values in Ap that 
the state at time M + i^, must also be zero. The reasoning 
is as in the proof of Theorem IB. II Repeating this argument 
again and again, we conclude that all states for time > are 
zero. As a result, u'{Q) — u'{l) = ■ ■ ■ = u'{v^ — 1) = 0, so 
that 

s[C AC ••• A'^'-^C ] 

= I[ C AC ••• A^-'^C ] . 

We now prove that the above equation implies that s = s. 
By Theorem 3.10 of |13|, the rows of i?'*"^ are not only 
p-linearly independent but also a p-generator sequence. By 
Lemma 2.8 of [13J any p-linear combination of these rows 
is then unique. By construction, this property is inherited by 
the rows of [ C AC ■ ■ ■ A'^'-'^C ] . Since both s and s 
take their values in Ap, it therefore follows that s — s, which 
proves the theorem. Obviously, the size of the trellis state set 
S equals . □ 
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